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900 



TAATATTTCCTTTTTCTTTTATTGCGTTACCTGTTCACCAC (41) 
YKGKRK^RNGQVV 



AAGCTGTGAACTCAGGTGTGCACAATTATCAGGAACACCCCAAAACCAAAGTGAGGTAGA 



TTCGACACTTGAGTCCACACGTGTTAATAGTCCTTGTGGGGTTTTGGTTTCACTCCATCT { 101 ) 
KL*TQVCTIIRNTPKPK*GR 



AAT AG CAT GAGAAG C C G T GT T T GAT GT T AATTAA T T 
961 + + + 

TTATCGTACTCTTCGGCACAAACTACAATTAATTAA ( 137 ) 
NSMRSRV*C*LI 



The inversion sequence of the apo-dystrophin-4 cDNA (SEQ ID NO: 1) 



Figure 1 



P 1 8 2002 

TECH CENTER t600/2900 



Inversion start site 
I 

TA7UVGAAAGAATT AT AAAG GAAAAAGAAAAT AAC G C AAT G GACAAGT GGT G 

850 + + + + + + 900 

ATTTCTTTCTTAATATTTCCTTTTTCTTTTATTGCGTTACCTGTTCACCAC (51) 
*RKNYKGKRK*RNGQVV 



AAGCTGTGAACTCAGGTGTGCACAATTATCAGGAACACCCCAAAACCAAAGTGAGGTAGA 

901 + + + + + + 960 

TTCGACACTTGAGTCCACACGTGTTAATAGTCCTTGTGGGGTTTTGGTTTCACTCCATCT ( 111 ) 
KL*TQVCTIIRNTPKPK*GR 



AATAGCATGAGAAGCCGTGTTTGATGTT AATTAA TT 

961 + + + 996 

TTATCGTACTCTTCGGCACAT^ACTACAATTAATTAA (14 7) 
NSMRSRV*C*LI 



The inversion sequence of the apo-dystrophin-4 cDNA plus a 10 base-pair region 5' 



to the start of the inversion sequence (SEQ ID NO: lA, included in SEQ ID NO: 2). 



Figure lA 




AACAATGGCAG 

+ + 720 

TTGTTACCGTC (11) 
Q W Q 



GTTTTACACGTCTATGCAATTGTACAAAAAAGTTATAAGAAT^CTACATGTAAAATCTTG 

721 + + + + + + 780 

CAAAATGTGCAGATACGTTAACATGTTTTTTCAATATTCTTTTGATGTACATTTTAGAAC (71) 
VLHVYAIVQKSYKKTTCKIL 



ATAGCTAAATAACTTGCCATTTCTTTATATGGAACGCATTTTGGGTTGTTTAAAAATTTA 

781 + + + + + + 840 

TATCGATTTATTGAACGGTAAAGAAATATACCTTGCGTAAAACCCAACAAATTTTTAAAT ( 131 ) 
lAK* LAISLYGTHFGLFKNL 
inversion start site 
I 

TAACAGTTATAAAGAAAGAATTATAAAGGAAAAAGAAAATAAC GCAAT G GACAAGT GGT G 

ATTGTCAATATTTCTTTCTTAATATTTCCTTTTTCTTTTATTGCGTTACCTGTTCACCAC (191) 
*QL*RKNYKGKRK*RNGQVV 



AAGCTGTGAACTCAGGTGTGCACAATTATCAGGAACACCCCAAAACCAAAGTGAGGTAGA 

901 + + + + + + 960 

TTCGACACTTGAGTCCACACGTGTTAATAGTCCTTGTGGGGTTTTGGTTTCACTCCATCT (251) 
KL*TQVCTIIRNTPKPK*GR 



AATAGCATGAGAAGCCGTGTTTGATGTT AATTAA TT 

961 + + + 996 

TTATCGTACTCTTCGGCACAAACTACAATTAATTAA ( 287 ) 
NSMRSRV*C*LI 



The inversion sequence of the apo-dystrophin-4 cDNA plus the upstream 150 bp from the 
start of the inversion at 860 to the Hpa I enzyme site 
(SEQ ID NO: IB, included in SEQ ID NO: 2) 



Figure IB 



GTGGTTTGATTGATAGTAAAAAAAATGTTCGTTAATACAAGTAGAGAGTAAGTAATCAAT 

CACCAAACTAACTATCATTTTTTTTACAAGCAATTATGTTCATCTCTCATTCATTAGTTA 
VV*LIVKKMFVNTSRE*VIN 

CAATCACTCATAGCCAAGGTGGAAAAGATGTATCCCATCATGGAATATTCCTGTTCTGAT 

GTTAGTGAGTATCGGTTCCACCTTTTCTACATAGGGTAGTACCTTATAAGGACAAGACTA 
QSLIAKVEKMYPIMEYSCSD 

AGAAATCTTGTGCTTATCTATGGAATTCTTTTGATATATATTTACATTGGGAACCTGAAT 

TCTTTAGAACACGAATAGATACCTTAAGAAAACTATATATAAATGTAACCCTTGGACTTA 
RNLVLIYGILLIYIYIGNLN 

GTAGCTTGACATTTTTCCATGTAAACACCAGTAGCCTGATCCAACATTAAGCTGATACTA 

CATCGAACTGTAAAAAGGTACATTTGTGGTCATCGGACTAGGTTGTAATTCGACTATGAT 
VA*HFSM*TPVA*SNIKLIL 

ACAAACAACGTGTAATGGCTTCATTAATAAGGCTTTGCTTCTTCCTGGAAACTGGTGAAA 

241 + + + + + + 300 

TGTTTGTTGCACATTACCGAAGTAATTATTCCGAAACGAAGAAGGACCTTTGACCACTTT 
TNNV*WLH* *GFASSWKLVK 

AATCAAACCTTGTTGTGTACACCCTCGATGCAGCTTCTGTGTTGTCTTCACCCAGAAATG 

301 + + + + + + 360 

TTAGTTTGGAACAACACATGTGGGAGCTACGTCGAAGACACAACAGAAGTGGGTCTTTAC 
NQTLLCTPSMQLLCCLHPEM 




The polynucleotide sequence of apo-dystrophin-4 (SEQ ID NO: 2) 



Figure 2 



GGGAATGATTTCCCAAATGGCAAAGAAACAGAGTGATGCTATCTATCTGCACCTTTTGTA 

CCCTTACTAAAGGGTTTACCGTTTCTTTGTCTCACTACGATAGATAGACGTGGAAAACAT 
GNDFPNGKETE*CYLSAPFV 

AAGTCTGTCTTTCTTTCTCTTTGTTTTCCAGGACACAATGTAGGAAGTCTTTTCCACATG 

421 + + + -f + + 480 

TTCAGACAGAAAGAAAGAGAAACAAAAGGTCCTGTGTTACATCCTTCAGAAAAGGTGTAC 
KSVFLSLCFPGHNVGSLFH M 

GCAGATGATTTGGGCAGAGCGATGGAGTCCTTAGTATCAGTCATGACAGATGAAGAAGGA 

481 + + + + + + 540 

CGTCTACTAAACCCGTCTCGCTACCTCAGGAATCATAGTCAGTACTGTCTACTTCTTCCT 
ADDLGRAMESLVSVMTDEEG 

GCAGAATAAATGTTTTACAACTCCTGATTCCCGCATGGTTTTTATAATATTCATACAACA 

541 + + + + + + 600 

CGTCTTATTTACAAAATGTTGAGGACTAAGGGCGTACCAATW^TATTATAAGTATGTTGT 
AE^MFYNS + FPHGFYNIHTT 

AAGAGGATTAGACAGTAAGAGTTTACAAGAAATAAATCTATATTTTTGTGAAGGGTAGTG 

601 + + + + + + 660 

TTCTCCTAATCTGTCATTCTCAAATGTTCTTTATTTAGATATAAAAACACTTCCCATCAC 
KRIRQ*EFTRNKSIFL*RVV 

GTATTATACTGTAGATTTCAGTAGTTTCTAAGTCTGTTATTGTTTTGTTAACAATGGCAG 

CATAATATGACATCTAAAGTCATCAAAGATTCAGACAATAACAAAACAATTGTTACCGTC 
VLYCRFQ*FLSLLLFC*QWQ 




Figure 2 (cont'd) 



GTTTTACACGTCTATGCAATTGTACAAAAAAGTTATAAGAAAACTACATGTAAAATCTTG 

CAAAAT GT GCAGATAC GTTAACAT GTTTTTT CAATATT CTTTT GAT GTACATTTTAGAAC 
VLHVYAIVQKSYKKTTCKIL 

ATAGCTAAATAACTTGCCATTTCTTTATATGGAACGCATTTTGGGTTGTTTAAAAATTTA 

781 + + + + + + 840 

TATCGATTTATTGAACGGTAAAGAAATATACCTTGCGTAAAACCCAACAAATTTTTAAAT 
IAK*LAISLYGTHFGLFKNL 

TAACAGTTATAAAGAAAGAATTATAAAGGAAAAAGAAAATAACGCAATGGACAAGTGGTG 

841 + + + + -f + 900 

ATTGTCAATATTTCTTTCTTAATATTTCCTTTTTCTTTTATTGCGTTACCTGTTCACCAC 
*QL*RKNYKGKRK*RNGQVV 

AAGCTGTGAACTCAGGTGTGCACAATTATCAGGAACACCCCAAAACCAAAGTGAGGTAGA 

901 + + + + + + 960 

TTCGACACTTGAGTCCACACGTGTTAATAGTCCTTGTGGGGTTTTGGTTTCACTCCATCT 
KL*TQVCTIIRNTPKPK*GR 

AATAGCATGAGAAGCCGTGTTTGATGTTAATTAATT 

961 + + + 996 

TTATCGTACTCTTCGGCACAAACTACAATTAATTAA 
NSMRSRV*C*LI 




Figure 2 (cont'd) 





M 13 14 15 16 17 18 19 20 21 23 2 4 M 



Figure 3A 
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Figure 3B 
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Figure 5 



"^J^T & ^'^^ • TAGTTTCCTATTCAATGTATAGTGCACCAAAGGTCAATTCAAGAGTTTATTATTATT 

. . ATCAAAGGATAAGTTACATATCACGTGGTTTCCAGTTAAGTTCTCAAATAATAATAA 
*FPIQCIVHQRSIQEFIII 



-180 



ATTTTCAACCCAAGTAAAAGCAGAGAGAAAATAGCCACCTCCACCATAGCCTCAG7\AGCA 

TAAAAGTTGGGTTCATTTTCGTCTCTCTTTTATCGGTGGAGGTGGTATCGGAGTCTTCGT 
IFNPSKSREKIATSTIASEA 



-120 



AGCCAACAGCCTGAAACAGCTTTGAAATGAAAAGTTGGTGTGGCGGTGATGGTGGCAGTG 

-119 + + + + + + -60 

TCGGTTGTCGGACTTTGTCGAAACTTTACTTTTCAACCACACCGCCACTACCACCGTCAC 
SQQPETALK* KVGVAVMVAV 



ATAATGGTGACCGATGGTTGGGTGCTGGTGATGGTAGTGGTAGTTGTGAAGGTGGTGATG 

TATTACCACTGGCTACCAACCCACGACCACTACCATCACCATCAACACTTCCACCACTAC 
IMVTDGWVLVMVVVVVKVVM 



GTGGTTTGATTGATAGTAAAAAAAATGTTCGTTAATACAAGTAGAGAGTAAGTAAT CAAT 

CACCAAACTAACTATCATTTTTTTTACAAGCAATTATGTTCATCTCTCATTCATTAGTTA 
VV*LIVKKMFVNTSRE*VIN 



CAATCACTCATAGCCAAGGTGGAAAAGATGTATCCCATCATGGAATATTCCTGTTCTGAT 

GTTAGTGAGTATCGGTTCCACCTTTTCTACATAGGGTAGTACCTTATAAGGACAAGACTA 
QSLIAKVEKMYPIMEYSCSD 

AGAAATCTTGTGCTTATCTATGGAATTCTTTTGATATATATTTACATTGGGAACCTGAAT 

121 + + + + + + 180 

TCTTTAGAACACGAATAGATACCTTAAGAAAACTATATATAAATGTAACCCTTGGACTTA 
RNLVLIYGILLIYIYIGNLN 



GTAGCTTGACATTTTTCCATGTAAACACCAGTAGCCTGATCCAACATTAAGCTGATACTA 

181 + + + -I- + + 240 

CATCGAACTGTAAAAAGGTACATTTGTGGTCATCGGACTAGGTTGTAATTCGACTATGAT 
VA*HFSM*TPVA*SNIKLIL 



ACAAACAACGTGTAATGGCTTCATTAATAAGGCTTTGCTTCTTCCTGGAAACTGGTGAAA 

TGTTTGTTGCACATTACCGAAGTAATTATTCCGAAACGAAGAAGGACCTTTGACCACTTT 
TNNV^WLH**GFASSWKLVK 



AATCAAACCTTGTTGTGTACACCCTCGATGCAGCTTCTGTGTTGTCTTCACCCAGAAATG 

301 + + + + + + 360 

TTAGTTTGGAACAACACATGTGGGAGCTACGTCGAAGACACAACAGAAGTGGGTCTTTAC 
NQT 'LLCTPSMQLLCCLHPEM 



Figure 6 (seq id no: sd 



GGGAATGATTTCCCAAATGGCAAAGAAACAGAGTGATGCTATCTATCTGCACCTTTTGTA 

CCCTTACTAAAGGGTTTACCGTTTCTTTGTCTCACTACGATAGATAGACGTGGAAAACAT 
GNDFPNGKETE*CYLSAPFV 

begin exon 79 



420 



AAGTCTGTCTTTCTTTCTCTTTGTTTTCC AGGA CACAATGTAGGAAGTCTTTTCCACATG 

TTCAGACAGAAAGAAAGAGAAACAAAAGGTCCTGTGTTACATCCTTCAGAAAAGGTGTAC 
KSVFLSLCFPGHNVGSLFHM 



480 



GCAGATGATTTGGGCAGAGCGATGGAGTCCTTAGTATCAGTCATGACAGATGAAGAAGGA 

481 + + + + + + 540 

CGTCTACTAAACCCGTCTCGCTACCTCAGGAATCATAGTCAGTACTGTCTACTTCTTCCT 
ADDLGRAMESLVSVMTDEEG 



GCAGAATAAATGTTTTACAACTCCTGATTCCCGCATGGTTTTTATAATATTCATACAACA 
541 + ^ + ^ + 500 

CGTCTTATTTACAAAATGTTGAGGACTAAGGGCGTACCAAAAATATTATAAGTATGTTGT 
AE*MFYNS*FPHGFYNIHTT 



( N ) 

AAGAGGATTAGACAGTAAGAGTTTACAAGAAATAAATCTATATTTTTGTGAAGGGTAGTG 

601 + + + + + + 660 

TTCTCCTAATCTGTCATTCTCAAATGTTCTTTATTTAGATATAAAAACACTTCCCATCAC 
KRIRQ-^^EFTRNKSIFL-^RVV 



GTATTATACTGTAGATTTCAGTAGTTTCTAAGTCTGTTATTGTTTTGTTAACAATGGCAG 

661 + + + + + + 720 

CATAATATGACATCTAAAGTCATCAAAGATTCAGACAATAACAAAACAATTGTTACCGTC 
VLYCRFQ*FLSLLLFC*QWQ 



GTTTTACACGTCTATGCAATTGTACAAAAAAGTTATAAGAAAACTACATGTAAAATCTTG 

CAAAATGTGCAGATACGTTAACATGTTTTTTCAATATTCTTTTGATGTACATTTTAGAAC 
VllHVYAIVQKSYKKTTCKIL 



ATAGCTAAATAACTTGCCATTTCTTTATATGGAACGCATTTTGGGTTGTTTAAAAATTTA 

781 + + + + + + 840 

TATCGATTTATTGAACGGTAAAGAAATATACCTTGCGTAAAACCCAACAAATTTTTAAAT 
IAK*LAISLYGTHFGLFKNL 
inversion start site 
I 

TAACAGTTATAAAGT^AAGAATTATAAAGGAAAAAGAAAATAACGCAATGGACAAGTGGTG 

841 + + + + + + 900 

ATTGTCAATATTTCTTTCTTAATATTTCCTTTTTCTTTTATTGCGTTACCTGTTCACCAC 
*QL*RKNYKGKRK*RNGQVV 



Figure 6 (cont'd) 



AAGCTGTGAACTCAGGTGTGCACAATTATCAGGAACACCCCAAAACCAAAGTGAGGTAGA 

TTCGACACTTGAGTCCACACGTGTTAATAGTCCTTGTGGGGTTTTGGTTTCACTCCATCT 
KL*TQVCTIIRNTPKPK*GR 



960 



AATAGCATGAGAAGCCGTGTTTGATGTT AATTAA TT 

961 + + + 996 

TTATCGTACTCTTCGGCACAAACTACAATTAATTAA 
NSMRSRV*C*LI 



Figure 6 (cont'd] 



•33 



451bp 



apo-4cDNA exon79 of dystrophin 



1.62 Kb deletion .657 Kb deletion 




860bp ; 



137bp 



inversion 



860bp 



genomic DNA exon 79 of dystrophin 



2.414Kb 
to end of 
dystrophin 
3.265Kb of 
3'UTR 



Figure 7 





4.1Kb 
2.9Kb 



A. Human B. Mouse 



Figure lOA 



Figure lOB 



.ex i4 RECEIVED 

w Seft 8 2002 

^ ^7 TECH CENTER 1600/2900 

1 50 

ID NO 54) Mgenl073 

lEQ ID NO 55) Hapol234 ctagtttcct attcaatgta tagtgcacca aaggtcaatt caagagttta 
Consensus 

51 100 

Mgenl073 

Hapol234 ttattattat tttcaaccca agtaaaagca gagagaaaat agccacctcc 
Consensus 

101 begin GRAIL exon @149 150 

Mgenl073 ttcACAGgCT tAAgCAGCca gtAAATGAcA 

Hapol234 accatagcct cagaagcaag ccaACAGcCT gAAaCAGCtt tgAAATGAaA 
Consensus ACAG-CT -AA-CAGC-- — AAATGA-A 

151 200 

Mgenl073 AtT T AtgtGgtAgt cAgGtcactG 

Hapol234 AgTtggtgtg gcggtgatgg tggcagtgaT AatgGtgAcc gAtGgttggG 

Consensus A-T T A G — A — -A-G G 



201 apo-4 5'end 250 

Mgenl073 TGCTGGTaAT GGTgaTctTA GcaGgcAgAG aaGGTGgTaG TGaTTTGATa 
Hapol234 TGCTGGTgAT GGTagTggTA GttGtgA.AG gtGGTGaTg G TG gTTTGATt 
Consensus TGCTGGT-AT GGT — T — TA G — G — A- AG — GGTG-T-G TG-TTTGAT- 

251 Ml 300 

Mgenl073 GtaAaagtgt AgAcTaTaCa acAgaAtAAa TAcAagtatA GTAA 

Hapol234 GatAgtaaaa Aa AaTg TtCg ttAatAcAAg TAgAgagtaA GTAAtcaatc 
Consensus G — A A-A-T-T-C- — A--A~AA- TA-A A GTAA 

301 M2 M3 350 

Mgenl073 ate caaCAAaGTG tgAAAGgTGT gTgCCATtAc acAtctTTCt 

Hapol234 aatcactcat agcCAAgGTG gaAAAG aTG T aTcCCATc At g gAataTTCc 
Consensus CAA-GTG — AAAG-TGT -T-CCAT-A- --A TTC- 

351 400 

Mgenl073 cG GtgATaagag cCTTgTCTAT GaAgTTC... TGAgATgTgT 

Hapol234 tGttctgata GaaATcttgt gCTTaTCTAT GgAaTTCttt TGAtATaTaT 

Consensus -G G--AT -CTT-TCTAT G-A-TTC TGA-AT-T-T 

401 450 

Mgenl073 TaggAagatG AAtCatcAat TtaCaT TTcTcCCcat cAAAtgaCAc 

Hapol234 TtacAttggG AAcCtgaAtg TagCtTgaca TTtTtCCatg tAAAcacCAg 

Consensus T A G AA-C A — T — C-T TT-T-CC -AAA CA- 

451 begin mouse GRAIL exon 500 

Mgenl073 cAtgCTGATC CAgtATTAAG CTaATACTAA C ACca tgcAatGCTT 

Hapol234 tAgcCTGATC CAacATTAAG CTgATACTAA CaaacaACgt gtaAtgGCTT 
Consensus -A--CTGATC CA— ATTAAG CT-ATACTAA C AC — A— GCTT 



501 550 
Mgenl073 CATTAAcAAG GaTTTGCTTC TTgCTaGAAA tgGGT. .AAA AaCggACtgT 
Hapol234 CATTAAtAAG GcTTTGCTTC TTcCTgGAAA ctGGTgaAAA AtCaaACctT 
Consensus CATTAA-AAG G-TTTGCTTC TT-CT-GAAA — GGT — AAA A-C — AC — T 



Figure 11 



^ ^ ^^02 -70 bp from 5' end of apo-4 

Inr - GCCC TCAT TCTG GAGAC (SEQ ID NO: 59) 
apo-4 = GCGG TGAT GGTG GCAGT (SEQ ID NO: 60) 

- 48% perfect homology with Inr 
71% match on type of base 
(purine vs. pyrimidine) 



Figure 12B 



Mgenl073 
Hapol234 
Consensus 



Mgenl073 
Hapol234 
Consensus 



Mgenl073 
Hapol234 
Consensus 



Mgenl073 
Hapol234 
Consensus 



Mgenl073 
Hapol234 
Consensus 



Mgenl073 
Hapol234 
Consensus 



Mgenl073 
Hapol234 
Consensus 



Mgenl073 
Hapol234 
Consensus 



Mgenl073 
Hapol234 
Consensus 



Mgenl073 
Hapol234 
Consensus 



551 600 
GgTcTGTAtA CCtTCaATGC AGCTTaTGTG TTGTCTTttC C.tgAAatG 
GtTgTGTAcA CCcTCgATGC AGCTTcTGTG TTGTCTTcaC CcagaAAtgG 
G-T-TGTA-A CC-TC-ATGC AGCTT-TGTG TTGTCTT— C C AA--G 



601 650 

GtAATGAcTc CCaAtAgtGg cAAccAgggG tacaATaCT TGCA 

GgAATGAtTt CCcAaAtgGc aAAgaAacaG agtgATgCTa tctatcTGCA 
G-AATGA-T- CC-A-A--G- -AA— A G AT-CT- TGCA 



651 

CacTTTGTAA 
CctTTTGTAA 
C — TTTGTAA 



A. . . .cTCTT 
AgtctgTCTT 
A TCTT 



TCTTTCTCTT 
TCTTTCTCTT 
TCTTTCTCTT 



TGTTTTCCAG 
TGTTTTCCAG 
TGTTTTCCAG 



exon79 700 

GACACAATGT 
GACACAATGT 
GACACAATGT 



701 

AGGAAGcCTT 
AGGAAGtCTT 
AGGAAG-CTT 



TTCCACATGG 
TTCCACATGG 
TTCCACATGG 



CAGATGATTT 
CAGATGATTT 
CAGATGATTT 



GGGCAGAGCG 
GGGCAGAGCG 
GGGCAGAGCG 



750 

ATGGAGTCCT 
ATGGAGTCCT 
ATGGAGTCCT 



751 

TAGTtTCAGT 
TAGTaTCAGT 
TAGT-TCAGT 



CATGACAGAT 
CATGACAGAT 
CATGACAGAT 



GAAGAAGGAG 
GAAGAAGGAG 
GAAGAAGGAG 



CAGAATAAAT 
CAGAATAAAT 
CAGAATAAAT 



800 

GTTTTACAAC 
GTTTTACAAC 
GTTTTACAAC 



801 850 

TCCTGATTCC CGCATGGTTT TTATAATATT CgTACAACAA AGAGGATTAG 

TCCTGATTCC CGCATGGTTT TTATAATATT CaTACAACAA AGAGGATTAG 

TCCTGATTCC CGCATGGTTT TTATAATATT C-TACAACAA AGAGGATTAG 



851 900 
ACAGTAAGAG TTTACAAGAA ATaAAATCTA TATTTTTGTG AAGGGTAGTG 
ACAGTAAGAG TTTACAAGAA AT.AAATCTA TATTTTTGTG AAGGGTAGTG 
ACAGTAAGAG TTTACAAGAA AT-AAATCTA TATTTTTGTG AAGGGTAGTG 



901 

GTAcTATACT 
GTAtTATACT 
GTA-TATACT 



GTAGATTTCA 
GTAGATTTCA 
GTAGATTTCA 



GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 



AGTCTGTTAT 
AGTCTGTTAT 
AGTCTGTTAT 



950 

TGTTTTGTTA 
TGTTTTGTTA 
TGTTTTGTTA 



951 1000 
ACAATGGCAG GTTTTACACG TCTATGCAAT TGTACAAAAA AGTTAaAAGA 
ACAATGGCAG GTTTTACACG TCTATGCAAT TGTACAAAAA AGTTAtAAGA 
ACAATGGCAG GTTTTACACG TCTATGCAAT TGTACAAAAA AGTTA-AAGA 



1001 1050 

AA. . .ACATG TAAAATCTTG ATAGCTAAAT AACTTGCCAT TTCTTTATAT 

AAactACATG TAAAATCTTG ATAGCTAAAT AACTTGCCAT TTCTTTATAT 

AA ACATG TAAAATCTTG ATAGCTAAAT AACTTGCCAT TTCTTTATAT 



Figure 11 (cont'd) 



begin inversion@1100 
1051 1100 
Mgenl073 GGAACGCATT TTGGGTTGTT TAAAAATTTA TAACAGTTAT AAAGAAAGAt 
Hapol234 GGAACGCATT TTGGGTTGTT TAAAAATTTA TAACAGTTAT AAAGAAAGAa 
Consensus GGAACGCATT TTGGGTTGTT TAAAAATTTA TAACAGTTAT AAAGAAAGA- 

1101 1150 
Mgenl073 TgtaAActaA Agtgtgcttt AtAAAAaAAg ttgtTtataA AaacccctAa 

Hapol234 TtatAAaggA A aa AgAAAAtAAc gcaaTggacA AgtggtgaAg 

Consensus T AA A A A-AAAA-AA- T A A A- 

1151 1200 
Mgenl073 acaaacACaC AcGcacaCAC AcacAcacac AcacaCaCAc AcaCAcAcTG 
Hapol234 ctgtgaACtC AgGtgtgCAC AattAtcagg AacacCcCAa AacCAaAgTG 
Consensus AC-C A-G CAC A A A C-CA- A— CA-A-TG 

1201 1243 
Mgenl073 AGGcAGcAca ttgtTttGcA ttacTtTagc gTGTatcaTA t.. 
Hapol234 AGGtAGaAat agcaTgaGaA gccgTgTttg aTGTtaatTA att 
Consensus AGG-AG-A — T — G-A T-T -TGT TA 



Figure 11 (cont'd) 




Figure 12A 




Figure 13 




Figure 14 



exon 79 inversion 
@451 @860 



Figure 15 




274 bp -I 

inversion 
product 




H 274 bp 

* inversion 
product 
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Figure 17A 
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12/23bp spacer 

CACAGTG ACAAAAACC 

heptamer nonamer 



Figure 18A 

B. 

inversion breakpoint i 

11640 11650 11660 | 11670 11680 

* * * I * * 

dystrophin T TTATAACAGT TAT AAAGAAA GA ^TTGTAAAC TAAAGTGTGC 
A AATATTGTCA ATATTTCTTT CT^AACATTTG ATTTCACACG 

a 

apo-4 cDNA 840 850 | 870 

[ 13 8 ] T TTATAACAGT TAT AAAGAAA GA 'TTaTAAAq gAAAaaGaaa> 
^ ^^^^^^^^^^ ^^^^^^^^^^ ^^^^^^^^ 

dystrophin T TTATAACAGT TATAAAGAAA GA^TTGTAAAC TAAAGTGTGC 



11690 11700 11710 11720 11730 

* ★ * * * 

dystrophin TTTATAAAAA AAAGTTGTTT ATAAAAACCC CTAAAAAC AA AACAAACACA 
AAATATTTTT TTTCAACAAA TATTTTTGGG GATTTTTGTT TTGTTTGTGT 
apo-4 cDNA 880 890 900 910 920 930 

[ 13 8 ] aTaAaAtggA cAAGTgGTga ATgtgAACtC aggtgtgCAc AAttAtCAgg> 

V V V WV V V W VW V WWVW V W V w 

dystrophin TTTATAAAAA AAAGTTGTTT ATAAAAACCC CTAAAAACAA AACAAACACA 



11740 11750 
* ★ 

dystrophin CACACACACA CA TACACACA 
GTGTGTGTGT GTATGTGTGT 
apo-4 cDNA 940 950 
[ 138 ] aACAC-CcCA -AaAC-CAaA> 
^^^^^ ^^^^ ^v^^ ^'^v^ 

dystrophin CACACACACA CATACACACA 



Figure 18B (seq id no: 57) 



SEP 1 62WK 



13130 
* 



inversion breakpoint2 

13140 13150 13160 | 13170 

* ★ * I * 



dystrophin AATTAGCTTT TGGAGAGT GG GTTTTGTCCA T TATTAATAA TTAATTAATT 
TTAATCGAAA ACCTCTCACC CAAAACAGGT AATAATTATT AATTAATTAA 



apo-4 

dystrophin 



990 

_<AATTAATT 
AATTAATT 



13180 13190 13200 13210 13220 

***** 

dystrophin AACATCAAAC ACGGCTTCTC ATGCTATTTC TACCTCACTT TGGTTTTGGG 
T TGTAQTTTG TGC CGAAGAG TACGATAAAG ATGGAGTGAA ACCAAAACCC 

980 970 960 950 940 

apo-4 <AACATCAAAC ACGGCTTCTC ATGCTATTTC TACCTCACTT TGGTTTTGGG 



dystrophin AACATCAAAC ACGGCTTCTC ATGCTATTTC TACCTCACTT TGGTTTTGGG 



13230 13240 13250 13260 13270 

***** 

dystrophin GTGTTCCTGA TAATTGTGCA CACCTGAGTT CACAGCTTCA CCACTTGTCC 
CACAAGGACT ATTAACACGT GTGGACTCAA GTGTCGAAGT GGTGAACAGG 

930 920 910 900 890 

apo-4 <GTGTTCCTGA TAATTGTGCA CACCTGAGTT CACAGCTTCA CCACTTGTCC 



dystrophin GTGTTCCTGA TAATTGTGCA CACCTGAGTT CACAGCTTCA CCACTTGTCC 



13280 13290 13300 13310 13320 

***** 

dystrophin ATTGCGTTAT TTTCTTTTTC CTTTATAAT T CTTTCTTT TT CCTTCATAAT 
TAACGCAATA AAAGAAAAAG GA AATATTAA GAAAGAAAAA GGAAGTATTA 

inversion breakpoint3 

880 870 860 850 840 

apo-4 <ATTGCGTTAT TTTCTTTTTC CTTTATAAT T CTTTCTTT aT aacTgtTAta 

^^^^^^^^^^ ^^^^^^^^^ 

dystrophin ATTGCGTTAT TTTCTTTTTC CTTTATAATT CTTTCTTTTT CCTTCATAAT 



Figure 18C (seq id no: ss) 



SEP 1 6 200Z 



inversion breakpoint 
11654 I 

-AAAGAAAGAT- 

I I I I I I I I I 



■TTTCTTTCTT" 
13299 



Figure 18D (Sequence included in SEQ ID NO: 57] 



inversion @ 860 
TAACAGT TATAAA GAAAGA?^TTATAAAGGAAAAAC;AAAATAj£^ 
841 + + + + + + 9 

ATTGTCAATWTTTCTTTCTTMTMITTCCTTTTTCTTTTATTGCGTTACCTGTTCACCAC 



Figure 19 (Sequence included in SEQ ID NO: 2) 
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Figure 20A 



Figure 20B 




Figure 22 




Figure 23 



1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 




K562 Apo-4 TF Sham 



Figure 24 




Figure 25B 



H2 starting at second methionine - 321 bp, predicted weight = 17.4Kd + 1 N- 
gI>«jJ^praion site + 20.4 Kd. 

Figure 26A 




Splice sites for peptide 



MYPIMEYSCSD RNLVLIYGrLLIYIYIGNT,NMK KFnKl^rFTTPr>gpM>^FTTFI 



Figure 26B 



start 


Exon 
No. 


Exon 
Position 


Exon 
Length 


Intron 
No. 


Intron 
Position 


Intron 
Length 


@88bp 


78.3 


@74-180 


106 bp 


79.1 


@181- 
529 


349 bp 




79.1 


@530- 
654 


125 bp 


79.4 


@655- 
720 


66 bp 




79.4 


@721- 
769 


49 bp 


79.55 


@770- 
875 


105 bp 




79.55 


@876- 
893 


18 bp 


79.75 


@894- 
932 


39 bp 




79.85 


@933- 
966 


33 bp 









Hydrophobicity Scale KD; Candidate membrane-spanning segments: 
Certain 1 12- 32 1.8833 



Figure 26C 
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A readthrough apo-4S product using the second available methionine 



The Apo-4S peptide sequence 

PI Begin TMl(R) 

+30 I P2 

MYPIMEYSCSD RNLVLIYGIL LIYIYIGNLN V ARHFSMKTP 
VARSNIKLIL 80 

TNNVKWLHKK GFASSWKLVK NQTLLCTPSM QLLCCLHPEM 
GNDFPNGKET 130 

P3 

E RCYLSAPFV KSVFLSLCFP GHN VGSLFHM A DDLGRAMES 
LVSVMTDEE G 180 

AEKMFYNSRF PHGFYNIHTT KRIRQKEFTR NKSIFLR RVV VLYCRFQKFL 
230 

SLLLFCKO WO VLHVYAIVQK SYKK TTCKIL lAKKLAISLY 
GTHFG LFKNL 280 

KQLKRKNYKG KRKKRNGQVV KLRTQVCTn RNTPKPKRGR 
NSMRSRVRCK 330 

LI 332 (302aa in predicted polypeptide) (SEQ ID NO: 56) 

Figure 27A 



Candidate membrane-spanning segments: 



Certain 


1 


41-61 1.9073 


Putative 


2 


101-121 


0.8052 


Certain 


3 


132-152 


1.2552 


Putative 


4 


217-237 


1.1833 


Putative 


5 


254-274 


0.9240 



Transmembrane segments included in structure No. 8: 1 2 3 4 5 

Loop lengths: 11 39 10 64 16 58; K+R profile: 1 2 5(9>22) 

K+R difference: -23: -> Orientation: N-out 

Charge-difference over N-terminal Membr. segs, (±15 residues): -4 
-> Orientation: N-out 

CYT-EXT profile (neg. values indicate cytoplasmic preference): < -0.13 < 
CYT-EXT difference: 0.13: -> Orientation: N-out 



Figure 27B 
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Figure 28 




Figure 28 (cont'd) 




Figure 29 




''feure 29 (cont'd) 




Figure 30 
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Figure 31 




Figure 31 (cont'd) 
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Additional Oligonucleotide primers used for apo-dystrophin-4 
southern blotting and sequencing 



FORWARD 

GTT CGT TAA TAC AAG TAG 
GCC AAG GTG GAA AAG ATG 
CCA GTA GCC TGA TCC AAC 
GGC TTC ATT AAT AAG 
GGC AAA GAA ACA GAG TG 
CAG GAC ACA ATG TAG GA 

GTT ATA AAG AAA GAA TTA TAA AG 
GAA AAT AAC GCA ATG GAC 

REVERSE 

GAT GGG ATA CAT CTT TTC C 
CAA GCT ACA TTC AGG TTC CC 
GGA CTC CAT CGC TCT GCC 
GAC TTA GAA ACT ACT G 
ATA GAC GTG TAA AAC CTG C 
AAC TGT TAT AAA TTT TTA 
CTT TTT CCT TTA TAA TTC TTT C 



F2.3(@28) (SEQIDN0 15) 

F2.2(@73) (SEQIDN0 16) 

F3.2(@208) (SEQ ID NO 17) 

F3.1(@257) (SEQ ID NO 18) 

F4.2(@379) (SEQ ID NO 19) 

F4. 1(@449) (SEQ ID NO 20) 

Fjn(@846) (SEQ ID NO 21) 

F5.1(@875) (SEQ ID NO 22) 



R6.I(@99) (SEQ ID NO 23) 

F2.2R(@188) (SEQ ID NO 24) 

R4. 1(@5 10) (SEQ ID NO 25) 

R3 .4(@694) (SEQ ID NO 26) 

R2. 1(@735) (SEQ ID NO 27) 

RSP2(@848) (SEQ ID NO 28) 

R2 3o(@875) (SEQ ID NO 29) 



Figure 34 



An Additional Splice Product Predicted From The Apo-4 Gene 

A second potential theoretical splice product which retains exon 78.3 is shown below. 

H2 pl-124 spliced product =351 bp, 117 amino acids + 10 from vector + 1 N-glycosylation 
site; predicted weight = 21.9 Kd 

Figure 35A 

Peptide Generated 

MFWTTKVEK]VryPIMEYSCSD RKL\a:.IYGILLIYIYIGNLNM KKEONKCFTTPDSRMVFII 
FIQQRGLDSKSLQEINLYFCEGFYTSMQLYKKVIRKLHKITQWTRTPQNQSEVEIA (1 17 
amino acids) (SEQIDNO30) 



Figure 35B 



Start 


Exon No. 


Exon 
Position 


Exon 
Length 


Intron 
No. 


Intron 
Position 


Intron 
Length 


@26bp 


78.1 


@16-41 


26 bp 


78.3 


@42-74 


33 bp 




78.3 


@75-181 


106 bp 


79.1 


@182-530 


349 bp 




79.1 


@53 1-655 


125 bp 


79.4 


@656-721 


66 bp 




79.4 


@722-770 


49 bp 


79.55 


@77 1-876 


105 bp 




79.55 


@877-894 


18 bp 


79.75 


@895-933 


39 bp 




79.85 


@934- 
967 


33 bp 









Hydrophobicity Scale KD; Candidate membrane-spanning segments: 
Certain 1 22- 42 1.8833 

Figure 35C 
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Nucleic Acid Subsequence Sites Identified In Apo-4 



Motif 

CpG 

CAAT 

TATAAT (5/6) 
TATA 



Position 

-7, (+28, +106) 

-132, (+127, +131) 

-120, -114, (+10) 

-154 



CCATTCA -162, -131 

TATCAGT +12, (+25) 

TGGCTGCAAGCCCAA (10/14) -57, (+41) 
(SEQ ID NO: 32) 

GTGATGG -140, -4, +11, +32 



Significance 
DNA methylation site 
Binding of CAAT factors 
TFim Binding site 
Binds RNA polymerase n 
and TFIID 
Cap Site I 
Cap Site H 

Binds CTF/NF-I protein 

Eucaryotic Transcription 
Initiation Site 



Figure 36 




Top Pred predicts 4-5 transmembrane domains for a full-length 
apo-4F product in which all the stop codons are suppressed. 



Protein sequence and position of predicted TM domains 



Begin TMi (R) 



PI 



P2 



MFVNTSREKV INQSLIAKVE K MYPIMEYSCSD RNLVLIYGIL LIYIYIGNLN VA RHFSMK60 

TPVARSNIKL ILTNNVKWLH KKGFASSWKL V KNQTLLCTP SMQLLCCLHP EMG NDFPNGK 120 

P3 

ETE RCYLSAP FVKSVFLSLC FPGH NVGSLF HMA DDLGRAM ESLVSVMTDE E GAEKMFYNS18 0 
RFPHGFYNIH TTKRIRQKEF TRNKSIFL RR VWLYCRFQK FLSLLLFCK Q WQVLHVYAIV 24 0 
QKSYK KTTCK ILIAKKLAIS LYGTHF GLFK NLKQLKRKNY KGKRKKRNGQ WKLRTQVCT 30 0 
IIRNTPKPKR GRNSMRSRVR CKLI (324 amino acids) (SEQ ID NO 31) 

Hydrophobicity Scale KD 



Certain 


1 


33- 


53 1.9073 


Putative 


2 


93- 


113 0.8052 


Certain 


3 


124- 


144 1.2552 


Putative 


4 


209- 


-229 1.1833 


Putative 


5 


246- 


■266 0.9240 



I. Transmembrane segments included in structure 8: 1 2 3 4 5; Loop lengths. 32 39 10 64 
16 58 



Figure 37A 



Apo-4F 



Candidate membrane-spanning segments: 



Figure 37B 



K+R difference: -19; -> Orientation: N-out; Charge-difference over N-terminal Membr. segs. 
(±15 residues): -3; -> Orientation: N-out 

CYT -EXT profile (neg. values indicate cytoplasmic preference): < < < < -0.13 < 

CYT-EXT difference: 0.13 
-> Orientation: N-out 

II. Transmembrane segments included in structure 7: 1 3 4 5; Loop lengths: 32 70 64 16 
58 

K+R profile: 5 > 22 > 5; K+R difference: 22 -> Orientation: N-in 
Charge-difference over N-terminal Membr. segs. (±15 residues): -3; -> Orientation: N-out 
CYT-EXT profile (neg. values indicate cytoplasmic preference): < -0.13 < -0.26 < 
CYT-EXT difference: 0. 1 3; -> Orientation: N-out 

Figure 37B (cont'd) 
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